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Abstract —Prostate cancer is the most diagnosed form of cancer 
in Canadian men, and is the third leading cause of cancer 
death. Despite these statistics, prognosis is relatively good with 
a sufficiently early diagnosis, making fast and reliable prostate 
cancer detection crucial. As imaging-based prostate cancer 
screening, such as magnetic resonance imaging (MRI), requires 
an experienced medical professional to extensively review the 
data and perform a diagnosis, radiomics-drlven methods help 
streamline the process and has the potential to significantly 
improve diagnostic accuracy and efficiency, and thus improving 
patient survival rates. These radlomics-driven methods currently 
rely on hand-crafted sets of quantitative imaging-based features, 
which are selected manually and can limit their ability to fully 
characterize unique prostate cancer tumour phenotype. In this 
study, we propose a novel discovery radiomics framework for gen¬ 
erating custom radiomic sequences tailored for prostate cancer 
detection. Discovery radiomics aims to uncover abstract imaging- 
based features that capture highly unique tumour traits and 
characteristics beyond what can be captured using predefined 
feature models. In this paper, we discover new custom radiomic 
sequencers for generating new prostate radiomic sequences using 
multi-parametric MRI data. We evaluated the performance of the 
discovered radiomic sequencer against a state-of-the-art hand¬ 
crafted radiomic sequencer for computer-aided prostate cancer 
detection with a feedforward neural network using real clinical 
prostate multi-parametric MRI data. Results for the discovered 
radiomic sequencer demonstrate good performance in prostate 
cancer detection and clinical decision support relative to the 
hand-crafted radiomic sequencer. The use of discovery radiomics 
shows potential for more efficient and reliable automatic prostate 
cancer detection. 

Index Terms —Discovery radiomics, radiomic sequencing, au¬ 
tomatic prostate cancer detection, multi-parametric magnetic 
resonance imaging (MP-MRI), deep convolutional network, deep 
features. 


1. Introduction 

Prostate cancer is the most diagnosed form of cancer (ex¬ 
cluding non-melanoma skin cancers) in Canadian and Ameri¬ 
can men. According to the Canadian Cancer Society [1], there 
is an estimated 24,000 new cases and 4,100 deaths from it in 
2015, making it the third most deadly cancer and accounting 
for approximately 10% of cancer deaths in Canadian men. 
Similarly in the United States, there is an estimated 220,800 
new cases and 27,540 deaths from prostate cancer in 2015, 
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making it the second most deadly cancer and accouting for 
approximately 9% of cancer deaths in American men [2]. The 
median patient survival time for metastatic prostate cancer 
is between 12.2 to 21.7 months [3]. However, prognosis is 
relatively good if the prostate cancer is detected early. As such, 
fast and reliable prostate cancer screening methods are crucial 
and can greatly impact patient survival rate, as the five-year 
survival rate in Canada is 96% for patients diagnosed with 
prostate cancer before the metastatic stage [4]. 

The current clinical model for initial prostate cancer screen¬ 
ing employs a digital rectal exam (DRE) or a prostate-specific 
antigen (PSA) test. Given a positive DRE or an elevated PSA, 
a patient undergoes a follow-up transrectal ultrasound (TRUS) 
guided multicore biopsy for risk stratification. The PSA in 
particular has recently come under scrutiny, as recent studies 
[5] [6] have demonstrated that the PSA test has a significant 
risk of overdiagnosis with an estimated 50% of screened 
men being diagnosed with prostate cancer. This oversensitivity 
leads to expensive and painful needle biopsies and subsequent 
overtreatment [5] [6] [7]. In addition, these prostate biopsies 
cause discomfort, possible sexual dysfunction, and increased 
hospital admission rates due to infectious complications while 
having a chance of the biopsy needle missing the cancerous 
tissue [8] [9]. The challenge diagnosticians currently face is 
how to improve the detection of prostate cancer by reducing 
the overdiagnosis due to conventional screening methods while 
still maintaining a high sensitivity. 

The use of magnetic resonance imaging (MRI) has recently 
grown in popularity as a non-invasive imaging-based prostate 
cancer detection method; however, a diagnosis through MRI 
requires an experienced medical professional to extensively 
review the data. Manual labelling of image data is time- 
consuming, and can lead to diagnostic inconsistencies due to 
variability between radiologists (inter-observer variability) and 
the variability of a radiologist over multiple sittings (intra¬ 
observer variability) [10] [11] [12]. To help raise the consis¬ 
tency of radiologists, the European Society of Urogenital Ra¬ 
diology (ESUR) introduced the Prostate Imaging - Reporting 
And Diagnosis System (PI-RADS) as a common set of criteria 
[13]. In addition, the use of multiple MRI modalities (multi- 
parametric MRI) has been shown to improve prostate cancer 
localization [14] via the extraction of unique information and 
features from each modality. Despite PI-RADS and further 
development to standardize diagnostic practices across multi- 
parametric MRI [15], there is still a level of subjectiveness 
in assessing MR images that can lead to inter-observer and 
intra-observer variability. 
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Automatic computer-aided prostate cancer detection or ra- 
diomics-driven methods for prostate cancer detection have 
been developed to help streamline the diagnostic process 
and increase diagnostic consistency. Radiomics refers to the 
high-throughput extraction and analysis of large amounts of 
quantitative features from medical imaging data to characterize 
tumour phenotypes, allowing for a high-dimensional mineable 
feature space that can be utilized for cancer detection and 
prognosis [16]. The prognostic power of radiomics has been 
previously shown in studies on lung and head-and-neck cancer 
patients, indicating the potential of radiomic features for 
personalized medicine and predicting patient outcomes [17] 
[18]. 

Current radiomic-driven methods for prostate cancer detec¬ 
tion typically employ a set of pre-defined, hand-crafted quan¬ 
titative features extracted from multi-parametric MR images. 
Lemaitre et al. [19] recently published a comprehensive review 
of state-of-art radiomics-driven methods for prostate cancer 
detection and diagnosis. The hand-crafted feature models 
used in these radiomics-driven methods typically included 
some combination of the following: first-order and second- 
order statistical features, Gabor filters, gradient-based features, 
fractal-based features, pharmacokinetic features, and discrete 
cosine transform (DCT) features. 

Madabhushi et al. [20] assessed the utility of combining 
multiple features for detecting prostate cancer in ex-vivo MRI 
(i.e., prostates glands obtained via radical prostatectomy). 
Three-dimensional texture feature sets were extracted from 
MRIs that had been corrected for background inhomogeneity 
and nonstandardness. The following feature sets were ex¬ 
tracted and used to train an ensemble of classifiers: first-order 
statistical features, second-order Haralick features, steerable 
Gabor filters, and gradient-based features. 

Tiwari et al. [21] proposed a method that combines struc¬ 
tural and metabolic imaging data for prostate cancer detection 
in multi-parametric MRI (including T2-weighting imaging and 
magnetic resonance spectroscopy). Using similar features as 
[20], Tiwari et al. detected cancerous regions within prostate 
tissue using a random forest classifier. 

Duda et al. [22] introduced a semi-automatic multi-image 
texture analysis for the characterization of prostate tissue using 
contrast-enhanced T1-weighted, T2-weighted, and diffusion- 
weighted imaging. The method simultaneously analysed sev¬ 
eral images (each acquired under different conditions) repre¬ 
senting the same part of the organ. In addition to the features 
used by [20], Duda et al. also extracted fractal-based and run 
length features. 

Litjens et al. [23] motivated their features with biology, 
using features that represent pharmacokinetic behaviour, sym¬ 
metry and appearance, and other anatomical aspects. Various 
features were extracted from different MR images: second- 
order statistical and Gabor features from T2-weighted images, 
multi-scale blobness from apparent diffusion coefficient maps, 
and curve fitting and pharmacokinetic features from contrast 
enhanced images. These features were used for prostate gland 
segmentation, generating a cancer likelihood map, and cancer¬ 
ous region classification. 

Ozer et al. [24] also proposed the use of pharmacokinetic 


parameters derived from contrast-enhanced MRI, combining 
it with T2-weighted and diffusion-weighted imaging. Using a 
relevance vector machine (RVM) with a Bayesian framework, 
Ozer et al. leveraged second-order statistical and (DCT) fea¬ 
tures from the peripheral zone of multi-parametric prostate 
MRI datasets to automatically segmented regions of cancer¬ 
ous tissue, and evaluated the method against support vector 
machines (SVM) with the same framework. Ozer et al. later 
extended their work to select a threshold value for increased 
segmentation performance, and further compared with a repre¬ 
sentative unsupervised segmentation method (Markov random 
field) [25]. 

Artan et al. [26] hand-crafted feature vectors using median- 
filtered intensity values extracted from axial-oblique fast spin- 
echo (FSE) T2-weighted, echo planar diffusion-weighted, 
multi-echo FSE, and contrast-enhance MR images. Using 
these features, Artan et al. developed a cost-sensitive SVM for 
automated prostate cancer localization and showed improved 
cancer localization accuracy over conventional SVMs. They 
also combined a conditional random field with the cost- 
sensitive framework that further improved prostate cancer 
localization via the incorporation of spatial information. 

Similar to [25], Liu et al. [27] introduced a method 
for unsupervised prostate cancer segmentation using fuzzy 
Markov random fields. They estimated the parameters of the 
Markovian distribution of the measured data, and applied it 
to parameter maps extracted from multi-parametric prostate 
MRI (T2-weighted MRI, quantitative T2, diffusion-weighted 
imaging, and contrast-enhanced MRI). 

Vos et al. [28] developed a fully automatic computer- 
aided detection method for prostate cancer using a supervised 
classifier in a two-stage classification approach. As prostate 
cancer can be discriminated from benign abnormalities due 
to their heterogeneity, Vos et al. analysed lesion candidates 
via the combination of a histogram analysis of T2-weighted 
axial images, pharmacokinetic maps, contrast-enhanced Tl- 
weighted, and apparent diffusion coefficient maps with texture- 
based features. 

Khalvati et al. [29] proposed a multi-parametric MRI texture 
feature model for radiomics-driven prostate cancer analysis. 
The texture feature model, based on the one proposed by Peng 
et al. [30], comprises of 19 low-level texture features extracted 
from each MRI modality, including features extracted from 
the gray-level co-occurrence matrix (GLCM). Khalvati et al. 
[31] more recently published radiomics-driven models as an 
extension of the previous texture feature model. An attempt at 
designing comprehensive quantitative feature sequences, the 
radiomics-driven models include additional MRI modalities, 
additional low-level features, and feature selection. 

While hand-crafted imaging-based feature models for 
radiomics-driven methods for prostate cancer detection have 
been shown to be highly effective, the generic nature of such 
feature models for generating radiomic sequences can limit 
their ability to fully characterize unique prostate cancer tumour 
phenotype. Motivated by this, we introduce the concept of 
discovery radiomics, where we forgo the notion of predefined 
feature models by discovering customized, tailored radiomic 
feature models directly from the wealth of medical imaging 
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Fig. 1: Overview of the proposed discovery radiomics framework for prostate cancer detection. Multi-parametric MRI data 
from past patients is used in the radiomic sequencer discovery process to discover a customized radiomic sequencer tailored for 
prostate cancer. For a new patient case, the patient’s multi-parametric MRI data is fed into the discovered radiomic sequencer 
to generate a custom, tailor-made radiomic sequence of abstract imaging-based features for tumor quantification and analysis. 


data already available. Discovery radiomics has the potential 
to find previously unexplored, abstract imaging-based features 
that capture highly unique tumour traits and characteristics 
beyond what can be captured using predefined feature models, 
thus allowing for improved personalized medicine and risk 
stratification through radiomic sequencing. 

To realize the concept of discovery radiomics, we propose 
a novel framework for discovering customized radiomic se¬ 
quencers that can generate radiomic sequences of abstract 
imaging-based features that are tailored for prostate cancer de¬ 
tection using multi-parametric MRI. In this study, we introduce 
very deep stochastic convolutional radiomic sequencers that 
are discovered using multi-parametric MRI data captured from 
past patient cases for generating custom radiomic sequences. 
The paper is structured as follows. Patient data and the 
underlying principles of the proposed discovery radiomics 
framework for prostate cancer detection are described in 
Section II. Experimental setup and comparative results are 
shown in Section III. Lastly, conclusions are drawn and future 
work is discussed in Section IV. 

II. Methods 

The proposed discovery radiomics framework for prostate 
cancer detection consists of the following steps (see Fig¬ 
ure 1). First, standardized multi-parametric MRI data from 
past patients along with pathology-verified radiologist tissue 
annotations are fed into the radiomic sequencer discovery 
process, where a customized radiomic sequencer is discov¬ 
ered for generating a large number of abstract imaging- 
based features that capture highly unique tumour traits and 


characteristics pertaining to prostate cancer. Second, for a new 
patient case, the discovered radiomic sequencer is then used 
to generate a custom, tailor-made radiomic sequence from 
the multi-parametric MRI data of the new patient case for 
comprehensive, custom quantification of the prostate cancer 
tumour phenotype. These discovered radiomic sequences can 
then be used by a classifier to provide information for clinical 
decision support for prostate cancer detection and diagnosis. 

A. Patient Data Collection 

A requirement of the proposed discovery radiomics frame¬ 
work is the presence of standardized multi-parametric MRI 
data from past patients. In this study, multi-parametric MRI 
data of 20 patients was acquired using a Philips Achieva 
3.0T machine at Sunnybrook Health Sciences Centre, Toronto, 
Ontario, Canada. Institutional research ethics board approval 
and patient informed consent for this study was obtained by 
the Research Ethics Board of Sunnybrook Health Sciences 
Centre. The patients’ ages ranged from 53 to 83. T2-weighted 
(T2-w), diffusion-weighted imaging (DWI), and correlated 
diffusion imaging (CDI) axial data was collected for each 
patient to form the multi-parametric MRI dataset. Table I 
summarizes the information regarding the acquired prostate 
multi-parametric MRI data including displayed field of view 
(DEOV), resolution, echo time (TE), and repetition time (TR). 

The multi-parametric MRI dataset also includes annotation 
information for isolating the prostate gland, and ground truth 
data for the size and location of tumour candidates. As ground 
truth, all images were reviewed and marked as healthy and 
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Fig. 2; Architecture of the very deep stochastic convolutional radiomic sequencer introduced in this study. The radiomic 
sequencer consists of 17 convolutional sequencing layers and two fully-connected sequencing layers. The weights of the 
connections are determined stochastically based on a learned distribution. 


TABLE I: Description of acquired prostate T2w, DWI, and 
CDI imaging data comprising the multi-parametric MRI 
dataset. 


Modality 

DFOV {cmD 

Resolution (mm^ 

TE (ms) 

TR (ms) 

T2w 

22 X 22 

0.49 X 0.49 X 3 

110 

4,697 

DWI 

20 X 20 

1.56 X 1.56 X 3 

61 

6,178 

CDI 

20 X 20 

1.56 X 1.56 X 3 

61 

6,178 


cancerous tissue by a radiologist with 18 and 13 years of 
experience interpreting body and prostate MRI, respectively. 
The multi-parametric MRI data and radiologist annotations 
were verified as accurate using corresponding histopathology 
data obtained through radical prostatectomy with Gleason 
scores of seven or higher. The study used the following multi- 
parametric MRI modalities; T2-weighted (T2w) imaging, ap¬ 
parent diffusion coefficient (ADC) maps, computed high-b 
diffusion-weighted imaging (CHB-DWI), and correlated dif¬ 
fusion imaging (CDI). The modalities are summarized in the 
following subsections. 

1) T2-weighted Imaging (T2w): T2-weighted (T2w) imag¬ 
ing is a MR imaging modality that characterizes the sensitivity 
of tissue using the differences in transverse (spin-spin) relax¬ 
ation time of the applied magnetic field. T2w imaging has 
been shown to provide some localization information due to a 
small reduction in signal for cancerous tissue in the prostate 
gland [14]. 

2) Diffusion-Weighted Imaging (DWI): Diffusion-weighted 
imaging (DWI) is an imaging modality in which the sensitivity 
of the tissue to the Brownian motion water molecules is 
measured through the application of lobe gradients (pairs of 
opposing magnetic field gradient pulses) [32]. The diffusion- 
weighted signal S is formulated as: 

5 = ( 1 ) 

where S'o is the signal intensity without diffusion weighting, 
b is the gradient strength and pulse duration, and D is the 
strength of the diffusion. The diffusion-weighted images are 
typically generated using different b values, and can be used 
to estimate apparent diffusion coefficient (ADC) maps via 


least-squares or maximum likelihood methods [32]. Cancerous 
tissue in ADC maps is usually presented with a darker intensity 
relative to surrounding healthy tissue [33]. 

3) Computed High-b Diffusion Weighted Imaging (CHB- 
DWI): Previous research has shown that high b-values in 
DWI data (e.g., b-values greater than 1, OOOs/mm^) allow for 
increased delineation between healthy and cancerous tissue 
[34] [35]. Due to hardware limitations, acquiring high b- 
value images for prostate imaging is infeasible. CHB-DWI is 
a computational model for reconstructing high b-value DWI 
data using low b-value acquisitions [34] [36]. Our patient 
data includes CHB-DWI images constructed at a b-value of 
2, OOOs/mTO^ using a Bayesian model with the least-squares 
estimation used to estimate our ADC maps. 

4) Correlated Diffusion Imaging (CDI): Correlated Diffu¬ 
sion Imaging (CDI) is a new diffusion MRI modality that 
leverages the joint correlation in signal attenuation across 
multiple gradient pulse strengths and timings to improve 
delineation between cancerous and healthy tissue [37]. As 
cancerous tissue generates higher intensities at high b-values, 
better delineation can be achieved by adjusting the utilized 
b-values for a given application. The overall characterization 
of the water diffusion is better represented via the correlation 
of signal attenuation across all b-values within a local sub¬ 
volume, which is obtained via signal mixing [37]: 

pbn 

I So{x)...Sn{x)P{So{x),...Sn{x)\ 

bO 

V{x) X dSo(x)...dSn{x) (2) 

where b^ represents the utilized b-values, x is the spatial 
location, S is the acquired signals, P represents the conditional 
joint probability density function, and V(x) is the local sub¬ 
volume centred at x. 


CDI(x) = J 


B. Tumour Candidate Detection 

As CDI [37] shows excellent delineation between cancerous 
and healthy prostate tissue, tumour candidates were extracted 
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from the dataset using a simple threshold on CDI data: 


CDI(x) 


mask — 


CDI{x) > 
otherwise 


(3) 


where CDI{x) refers to the CDI value at x, CDImax is the 
max CDI value, and CDImask is the binary mask of tumour 
candidates for a given MRI slice. A total of 80 cancerous 
regions and 714 healthy regions were identified as tumour 
candidates from the 20 different patients and used in this study. 


C. Radiomic Sequencer Discovery 

Given the standardized multi-parametric MRI data from 
past patients and pathology-verified radiologist tissue anno¬ 
tations, the radiomic sequencer discovery process discovers a 
customized radiomic sequencer for generating a large number 
of abstract imaging-based features that capture highly unique 
tumour traits and characteristics pertaining to prostate cancer. 
In this study, we introduce a novel very deep stochastic 
convolutional radiomic sequencer, as shown in Figure 2, for 
discovering custom-tailored radiomic sequences for prostate 
cancer detection and diagnosis. Inspired by the very deep 
convolutional network structure in [38], the proposed radiomic 
sequencer consists of 17 convolutional sequencing layers and 
two fully-connected sequencing layers of 1000 and 500 nodes, 
respectively. An important challenge with conventional very 
deep convolutional networks is that they have a large number 
of parameters to learn, necessitating an extremely large dataset 
for training. Due to the limited quantity of prostate cancer 
patient cases that can be obtained, such networks cannot be 
learned sufficiently and thus, it is difficult to use such networks 
as radiomic sequencers. We address this important challenge 
in two main ways in the proposed radiomic sequencer. 

First and foremost, we forgo the notion of having the 
weights of the connections between nodes be individual 
parameters to learn, and instead have the weights of the 
connections between nodes determined stochastically based 
on a learned distribution. As such, the number of parameters 
that must be learned is significantly reduced, making it well- 
suited as a radiomic sequencer that can be discovered with a 
reasonable amount of patient cases. Second, data augmentation 
is performed via the rotation of each tumour candidate at 45° 
intervals (i.e., 0°, 45°, 90°, 135°, 180°, 225°, 270°, 315°). 
While the proposed radiomic sequencer has significantly fewer 
parameters to learn relative to conventional networks, the 
current dataset is still insufficiently large to properly train the 
radiomic sequencer. Therefore, data augmentation is used to 
grow our existing dataset, resulting in 640 cancerous regions 
and 5,712 healthy regions (as determined by corresponding 
radiologist annotations and pathology data) that can now be 
used as tumour candidates for training the proposed radiomic 
sequencer. In this study, the proposed radiomic sequencer is 
discovered via iterative scaled conjugate gradient optimization 
using cross-entropy as the objective function. 

In the proposed radiomic sequencer, each convolutional 
sequencing layer (Figure 3) consists of stochastically realized 
receptive fields, an absolute value rectification unit (AVReU) 
to introduce non-saturating nonlinearity into the sequencer. 


and a spatial overlapping median pooling layer. Detailed 
descriptions of each part of the convolutional sequencing layer 
is described in the following subsections. 
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Fig. 3: Each convolutional sequencing layer consists of 
stochastically realized receptive fields, absolute value rectifi¬ 
cation unit (AVReU), and spatial overlapping median pooling. 


1) Stochastically Realized Receptive Fields: Unlike con¬ 
ventional convolutional networks where the weights of recep¬ 
tive field are trained via back propagation [39], the proposed 
radiomic sequencer leverages stochastically realized receptive 
fields to significantly reduce the number of parameters that 
need to be learned. The connection weights encapsulated by 
the receptive fields are abstracted into probability distributions, 
and each receptive field represents the shared weights for all 
localized nodal collections. In this study, each receptive field is 
treated as a random field, with the connection weights in the 
receptive field being spatially correlated subject to a spatial 
correlation parameter which varies the level of spatial 
correlation between connection weights within a receptive 
field. As such, rather than learning individual connection 
weights for the receptive fields, one now only needs to learn a 
single parameter ip. The proposed radiomic sequencer has 17 
convolutional sequencing layers; the number of stochastically 
realized receptive fields and receptive field size used in each 
convolutional sequencing layer is specified in Table II. 


TABLE II: Summary of number of stochastically realized 
receptive fields and receptive field size at each convolutional 
sequencing layer. 


Convolutional 
Sequencing Layer 

Number of 
Receptive Fields 

Receptive Field Size 

1, 2 

64 

5x5 

3, 4 

128 

5x5 

5, 6, 7, 8 

256 

5x5 

9, 10, 11, 12, 13, 14, 

512 

5x5 

15, 16 

17 

2000 

5x5 


2) Absolute Value Rectification Unit: Conventional con¬ 
volutional networks typically model a nodes’s output / as 
a function of its input x, such that f{x) = tanh{x) or 
f{x) = (1 -f e~^)~^. Flowever, these saturating nonlinearities 
train considerably slower than non-saturating nonlinearities 
(generally referred to as Rectified Linear Units or ReLUs) 
such as f{x) = max{Q,x). Similar to Krizhevsky et al. [39], 
we incorporate absolute value rectification units (AVReUs) 
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(/(x) = |a;|) to introduce non-saturating nonlinearity into the 
proposed radiomic sequencer. 

3) Spatial Pooling: Traditional local pooling layers in 
convolutional neural networks encapsulate the outputs of 
neighbouring nodes, and are typically done by pooling non¬ 
overlapping neighbourhoods into a single value centred at the 
location of the pooling neighbourhood [39]. Non-overlapping 
spatial pooling is usually performed to reduce computational 
complexity by indirectly decreasing the effective image size 
after each pooling operation. Since with the proposed radiomic 
sequencer the parameters are already significantly reduced, we 
instead introduce a spatial non-overlapping median pooling 
layer in each convolutional sequencing layer that preserves 
the effective image size. 

III. Results 

A. Experimental Setup 

To assess the usefulness of the proposed discovery ra- 
diomics framework for computer-aided prostate cancer de¬ 
tection, the discovered radiomic sequencer (DRS) from the 
proposed framework was compared against Peng et al.’s [30] 
and Khalvati et al.’s [31] hand-crafted radiomic sequencers for 
classifying tumour candidates as either healthy or cancerous 
using a feedforward neural network classifier with a single 
hidden layer of 100 nodes. While [30] and [31] evaluated 
the hand-crafted radiomic sequencers for voxel-based clas¬ 
sification via a linear discriminant analysis (LDA) classifier 
and support vector machine (SVM), respectively, only the 
hand-crafted radiomic sequencers themselves were included 
in this study to assess the use of hand-crafted and discovered 
radiomic sequencers. The performance of the classifier learned 
based on each radiomic sequencer was evaluated using leave- 
one-patient-out cross-validation using balanced training and 
testing data (i.e., an equal number of healthy and cancerous 
tumour candidates), and quantitatively assessed via sensitivity, 
specificity, and accuracy metrics: 

^ ... TP ^ . TN 

Sensitivity = Specmcity = 

TN P TP 

Accuracy = p 

where the performance of each method was quantified by 
the metrics’ closeness to one. TP is the number of tumour 
candidates identified as cancerous by both the classifier and 
the radiologist’s tissue segmentation, TN is the number of 
tumour candidates identified as healthy by both the classifier 
and the radiologist’s segmentation, N is the number of tumour 
candidates not in the radiologist segmented tissue (i.e., healthy 
prostate tissue), and P is the number of tumour candidates in 
the radiologist segmented tissue (i.e., cancerous tissue). 

To further analyse the effectiveness of DRS relative to Peng 
et al. [30] and Khalvati et al. [31], the Fisher criterion was 
used to assess the separability of the generated imaging-based 
features. The Fisher Criterion (FC) represents the separability 
of the generated healthy and cancerous radiomic sequences. 


The FC scores for the radiomic sequences generated for 
healthy (h) and cancerous (c) candidate regions were computed 
via the following: 

FC = ~ (4) 

-f (Tc 

where p,h and p,c represent the mean values of the healthy 
and cancerous radiomic sequences, respectively, and ah and 
CTc represent their respective standard deviations. A higher FC 
score corresponds to a higher degree of separability between 
the healthy and cancerous radiomic sequences, indicating the 
better delineation between healthy and cancerous regions using 
the generated radiomic sequences. 

B. Results 

The discovered radiomic sequencer (DRS) was evaluated 
against two state-of-the-art hand-crafted radiomic sequencers 
proposed by Peng et al. [30] and Khalvati et al. [31] for 
classifying tumour candidates as either healthy or cancerous 
using a feedforward neural network classifier with a single 
hidden layer of 100 nodes. Sensitivity, specificity, and accuracy 
were calculated via leave-one-patient-out cross-validation with 
balanced training and testing datasets (as the majority of 
the tumour candidates were healthy) to prevent the accuracy 
metric from being strongly skewed by the specificity rate. 
Table III shows the performance metrics for classification 
using the hand-crafted radiomic sequencers [30] [31] and the 
discovered radiomic sequencer. 

TABLE III: Comparison of hand-crafted radiomic sequencers 
[30] [31] with discovered radiomic sequencer (DRS) for 
tumour candidate classification. 



Sensitivity 

Specificity 

Accuracy 

Peng et al. [30] 

0.1824 

0.9005 

0.5821 

Khalvati et al. [31] 

0.3568 

0.9231 

0.6730 

DRS 

0.6400 

0.8203 

0.7365 


As shown in Table III, Khalvati et al. [31] performed better 
than Peng et al. [30] in terms of all performance metrics, 
and produced the highest specificity at 92.31%. While [31] 
has a high specificity, it is worth noting that the hand-crafted 
radiomic sequencer also has the lowest sensitivity (i.e., pro¬ 
portion of correctly identified cancerous tumour candidates) 
at 35.68%, missing almost two thirds of the cancerous candi¬ 
dates. This indicates that the hand-crafted radiomic sequencer 
generates radiomic sequences that better represent tumour 
candidates consisting of healthy tissue than tumour candidates 
consisting of cancerous tissue. 

Table III also shows that the discovered radiomic sequencer 
produced the highest sensitivity at 64.00% and accuracy at 
73.65%. In addition, the discovered radiomic sequencer has 
noticeably more consistent performance across the metrics rel¬ 
ative to both hand-crafted radiomic sequencers [30] [31], with 
specificity at 82.03%. This suggests that the custom radiomic 
sequences generated from the discovered radiomic sequencer 
are better able to represent both healthy and cancerous prostate 
tissue in a more balanced fashion, as opposed to favouring 
healthy tissue. 
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Statistical significance for each of the performance metrics 
was performed via a paired two-sided null hypothesis test (as 
shown in Table IV). The improvements in sensitivity using 
the discovered radiomic sequencer is statistically significant 
compared to both [30] and [31], with the improvements in 
accuracy using the discovered radiomic sequencer statistically 
significant compared to [30]. In addition, a separability anal¬ 
ysis was performed using the Fisher Criterion; the results of 
the separability analysis are shown in Table V. 

TABLE IV: Paired two-sided null hypothesis test results 
for statistical significance between hand-crafted radiomic se¬ 
quencers [30] [31] and discovered radiomic sequencer (DRS). 



Sensitivity 

Specificity 

Accuracy 

p-value against [30] 

0.0029 

0.0165 

0.0133 

p-value against [31] 

0.0277 

0.0056 

0.2396 


Table V shows that DRS obtained the highest FC score 
relative to Peng et al. [30] and Khalvati et al. [31]. This 
indicates that the discovered radiomic sequences produced by 
DRS has the best separability between healthy and cancerous 
radiomic sequences. These results demonstrate the potential of 
the proposed radiomics discovery framework for building cus¬ 
tom radiomic sequencers that can generate radiomic sequences 
tailored for prostate cancer characterization and detection. 

TABLE V: Comparison of hand-crafted radiomic sequencers 
[30] [31] with discovered radiomic sequencer (DRS) with 
respect to separability of extracted features. 



Fisher Criterion 

Peng et al. [30] 

0.0188 

Khalvati et al. [31] 

0.0144 

DRS 

0.0712 


IV. Conclusion 

In this study, a novel discovery radiomics framework for 
prostate cancer detection using multi-parametric MRI data was 
presented. Unlike conventional radiomics-driven methods that 
use a set of hand-crafted radiomic features, the discovered 
radiomic sequencer can generate radiomic sequences that are 
specifically tailored for quantifying and differentiating healthy 
and cancerous prostate tissue. 

The performance of the discovered radiomic sequencer was 
compared against two state-of-the-art hand-crafted radiomic 
sequencers [30] [31] via leave-one-patient-out cross-validation 
for the task of tumour candidate classification using a feed¬ 
forward neural network. While [31] produced the highest 
specificity (92.31%) and outperformed [30] with respect to 
all metrics, the Khalvati et al.’s hand-crafted radiomic se¬ 
quencer also produced the second lowest sensitivity (35.68%), 
indicating that almost two thirds of cancerous candidates are 
undetected. The discovered radiomic sequencer based on the 
proposed discovery radiomics framework, however, produced 
the highest sensitivity (64.00%) and accuracy (73.65%), and 
has noticeably more consistent performance across the metrics 
relative to [30] and [31], with specificity at 82.48%. 

Future work includes the use of additional MRI modalities 
(such as dual-stage correlated diffusion imaging [40]) to allow 


the proposed framework to discover radiomic sequencers that 
can produce more distinctive radiomic sequences for prostate 
cancer, and the investigation of transfer learning [41] (e.g., the 
application of a radiomic sequencer trained using non-prostate 
imaging data to prostate cancer data). In addition, further 
analysis of discovered sequences will be conducted to better 
understand the prostate cancer phenotype. The application of 
radiomic sequences can be extended to cancer grading and 
staging, enabling a non-invasive method for assessing the 
severity and behaviour of prostate cancer. As such, radiomic 
sequencing can lead to better patient care and survival rates 
through more reliable risk stratification. 
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